Expanding Queries Through Word Sense Disambiguation

نویسندگان

  • José Luis Martínez-Fernández
  • Ana M. García-Serrano
  • Julio Villena-Román
  • Paloma Martínez
چکیده

The use of semantic information in the right way can lead to improved precision and recall figures in Information Retrieval (IR) systems. This assumption is the start point for the work carried out by the MIRACLE research team at ImageCLEF 2006. For this purpose, an implementation of the specification marks Word Sense Disambiguation (WSD) method [4] has been developed. This method is based on WordNet [2] and tries to select the right sense of each word appearing in the query. This allows the inclusion of only the correct synonyms when a semantic expansion is done. This selective expansion method has been combined with a deeper linguistic analysis to interpret negations and filter out common phrases and expressions used in query captions. Results of the application of these techniques to the image retrieval task in CLEF 2006 are also included.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Word Sense Disambiguation for Cross-Language Information Retrieval

We have developed a word sense disambiguation algorithm, following Cheng and Wilensky (1997), to disambiguate among WordNet synsets. This algorithm is to be used in a cross-language information retrieval system, CINDOR, which indexes queries and documents in a language-neutral concept representation based on WordNet synsets. Our goal is to improve retrieval precision through word sense disambig...

متن کامل

Automatic query expansion and word sense disambiguation with long and short queries using WordNet under vector model

This paper describes the experimentation conducted to test the effectiveness of automatic query expansion and word sense disambiguation (WSD) using short and long query of a topic TREC under vector model. We ran different experiments generating queries under vector model using linguistic information extracted from WordNet. Results show that query expansion with short queries and long queries is...

متن کامل

On the Importance of Word Sense Disambiguation for Information Retrieval

Research in information retrieval has led to mixed results about the impact of natural language processing. This paper discusses the importance of word sense disambiguation despite these mixed results. We first discuss some of the factors that can cause apparent inconsistency in retrieval performance with regard to natural language processing: instability of test collection queries, different b...

متن کامل

Topic Level Disambiguation for Weak Queries

Despite limited success, today’s information retrieval (IR) systems are not intelligent or reliable. IR systems return poor search results when users formulate their information needs into incomplete or ambiguous queries (i.e., weak queries). Therefore, one of the main challenges in modern IR research is to provide consistent results across all queries by improving the performance on weak queri...

متن کامل

Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-language Information Retrieval

One of the aims of EuroWordNet (EWN) was to provide a resource for Cross-Language Information Retrieval (CLIR). In this paper we present experiments which test the usefulness of EWN for this purpose via a formal evaluation using the Spanish queries from the TREC6 CLIR test set. All CLIR systems using bilingual dictionaries must find a way of dealing with multiple translations and we employ a WS...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006